A Morpheme-based Method to Chinese Sentence-Level Sentiment Classification
نویسندگان
چکیده
Sentiment classification is a fundamental task in opinion mining. However, most existing systems require a sentiment lexicon to guide sentiment classification, which inevitably suffer from the problem of unknown words. In this paper, we present a morpheme-based fine-to-coarse strategy for Chinese sentence-level sentiment classification. To approach this, we first employ morphological productivity to extract sentiment morphemes from a sentiment dictionary and to calculate their polarity intensity at the same time. Then, we apply the acquired morpheme-level sentiment information to predict the semantic orientation of sentiment words and phrases within an opinionated sentence. Finally, we determine sentence-level semantic orientation by combining all the sentiment phrases and their relevant polarity scores. The experimental results on NTCIR-6 OAPT data set show our system can achieve state-of-the-art performance.
منابع مشابه
Chinese Sentence-Level Sentiment Classification Based on Fuzzy Sets
This paper presents a fuzzy set theory based approach to Chinese sentence-level sentiment classification. Compared with traditional topic-based text classification techniques, the fuzzy set theory provides a straightforward way to model the intrinsic fuzziness between sentiment polarity classes. To approach fuzzy sentiment classification, we first propose a fine-to-coarse strategy to estimate s...
متن کاملRadical-Based Hierarchical Embeddings for Chinese Sentiment Analysis at Sentence Level
Text representation in Chinese sentiment analysis is usually working at word or character level. In this paper, we prove that radical-level processing could greatly improve sentiment classification performance. In particular, we propose two types of Chinese radical-based hierarchical embeddings. The embeddings incorporate not only semantics at radical and character level, but also sentiment inf...
متن کاملWEMOTE - Word Embedding based Minority Oversampling Technique for Imbalanced Emotion and Sentiment Classification
Imbalanced training data always puzzles the supervised learning based emotion and sentiment classification. Several existing research showed that data sparseness and small disjuncts are the two major factors affecting the classification. Target to these two problems, this paper presents a word embedding based oversampling method. Firstly, a large-scale text corpus is used to train a continuous ...
متن کاملMorpheme-Enhanced Spectral Word Embedding
Traditional word embedding models only learn word-level semantic information from corpus while neglect the valuable semantic information of words’ internal structures such as morphemes. To address this problem, the goal of this paper is to exploit the morphological information to enhance the quality of word embeddings. Based on spectral method, we propose two word embedding models: Morpheme on ...
متن کاملClassifying Attitude by Topic Aspect for English and Chinese Document Collections
Title: Classifying Attitude by Topic Aspect for English and Chinese Document Collections Yejun Wu, Doctor of Philosophy, 2008 Dissertation directed by: Professor Douglas W. Oard College of Information Studies & Institute for Advanced Computer Studies, UMCP The goal of this dissertation is to explore the design of tools to help users make sense of subjective information in English and Chinese by...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Int. J. of Asian Lang. Proc.
دوره 21 شماره
صفحات -
تاریخ انتشار 2011